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ABSTRACT 

In targeted online advertising, advertisers look for maximiz¬ 
ing campaign performance under delivery constraint within 
budget schedule. Most of the advertisers typically prefer 
to impose the delivery constraint to spend budget smoothly 
over the time in order to reach a wider range of audiences 
and have a sustainable impact. Since lots of impressions are 
traded through public auctions for online advertising today, 
the liquidity makes price elasticity and bid landscape be¬ 
tween demand and supply change quite dynamically. There¬ 
fore, it is challenging to perform smooth pacing control and 
maximize campaign performance simultaneously. In this pa¬ 
per, we propose a smart pacing approach in which the deliv¬ 
ery pace of each campaign is learned from both offline and 
online data to achieve smooth delivery and optimal perfor¬ 
mance goals. The implementation of the proposed approach 
in a real DSP system is also presented. Experimental evalu¬ 
ations on both real online ad campaigns and offline simula¬ 
tions show that our approach can effectively improve cam¬ 
paign performance and achieve delivery goals. 

Categories and Subject Descriptors 

H. l.O [Information Systems]: Models and Principles— 
General', D.2.8 [Software Engineering]: Metrics— Perfor¬ 
mance Measures 

Keywords 

Campaign Optimization;Demand-Side Platform;Budget Pac¬ 
ing 

I. INTRODUCTION 

Online advertising is a multi-billion dollar industry and 
has been enjoying continued double-digit growth in recent 
years. The market has witnessed the emergence of search 
advertising, contextual advertising, guaranteed display ad¬ 
vertising, and more recently auction-based advertising ex¬ 
changes. We focus on the auction-based advertising ex¬ 
changes, which is a marketplace with the highest liquid- 
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ity, i.e., each ad impression is traded with a different price 
through a public auction. In this market, Demand-Side Plat¬ 
forms (DSPs) are key players who act as the agents for a 
number of different advertisers and manage the overall wel¬ 
fare of the ad campaigns through many direct buying ad- 
networks or real-time bidding (RTB) ad exchanges in order 
to acquire different ad impressions. The objectives of an 
advertiser on a DSP can be summarized as follows: 

• Reach the delivery and performance goals: for brand¬ 
ing campaigns, the objective is usually to spend out 
the budget to reach an extensive audience and mean¬ 
while make campaign performance as good as possible; 
for performance campaigns, the objective is usually to 
meet the performance goal (e.g. eCPC^no more than 
$2) and meanwhile spend as much budget as possible. 
Objectives of other campaigns are usually in-between 
these two extremes. 

• Execute the budget spending plan: advertisers usually 
expect their ads to be shown smoothly throughout the 
purchased period in order to reach a wider range of 
audience, have a sustainable impact, and increase syn¬ 
ergy with campaigns on other medias such as TV and 
magazines. Therefore, advertisers may have their cus¬ 
tomized budget spending plans. Figure gives two 
examples of budget spending plan: even pacing and 
trafhc based pacing. 

• Reduce creative serving cost: apart from the cost to 
be charged by DSPs, there is also creative serving cost 
charged by typically 3rd-party creative server providers. 
This is even more important nowadays that more and 
more ad campaigns are in the form of video or rich 
media. The creative serving cost of such type of im¬ 
pressions can be as much as premium inventory cost, 
so the advertisers will always be willing to reduce this 
cost and deliver impressions to the right users effec¬ 
tively and efhciently. 

However, it becomes more and more challenging to achieve 
all the above objectives simultaneously. In the individual 
campaign level, each campaign may have its own budget, 
budget spending plan, targeted audiences, performance goal, 
billing method, creative serving cost, and so on. In the 
network level, an increasing number of DSPs compete with 
each other simultaneously to acquire inventory through pub¬ 
lic auctions in many ad exchanges, and therefore the price 
elasticity and bid landscape between demand and supply 


^Effective cost per click - The cost of a campaign divided by the total 
number of clicks received. Effective cost per action (eCPA) is defined 
similarly. 





Figure 1: Different budget spending plans. 


change dynamically. All those varieties in both campaign 
and network level make the optimization extremely difficult 
even for a single campaign. 

Fortunately, thanks to the rapid growth of emerging in¬ 
ternet industries such as mobile apps and user-generated 
content platforms, the online advertising industry has ob¬ 
served a sharply increasing availability of advertising inven¬ 
tory. A DSP nowadays typically receives tens of billions of 
ad requests from dozens of Supply-Side Platforms (SSPs) ev¬ 
eryday and hence has more flexibility than before to spend 
budget on a vast amount of advertising opportunities. In 
this paper, we consider a problem motivated by these trends 
in the online advertising industry: can we smartly decide on 
which inventories the budget should be spent so that all the 
objectives of an ad campaign are achieved? We study this 
problem with a real DSP and explore models and algorithms 
to effectively serve ad campaigns it manages. Our contribu¬ 
tions can be summarized as follows: 

• We formulate the above multi-objective optimization 
problem with a comprehensive anatomy of real ad cam¬ 
paigns and propose to solve it through smart pacing 
control. 

• We develop a control-based method which learns from 
both online and offline data in order to optimize budget 
pacing and campaign performance simultaneously. 

• We implement the proposed approach in a real DSP 
and conduct extensive online/offline experimental eval¬ 
uations. The results show that our approach can ef¬ 
fectively improve campaign performance and achieve 
delivery goals. 

The rest of the paper is organized as follows: we review 
the related work in section 2. fn section 3, we present the 
notations and formal problem statement. We describe our 
models and algorithms in section 4 and 5. Section 6 and 7 
focus on experiments and implementation respectively. We 
conclude our work and discuss future work in section 8. 


2. RELATED WORK 

Most existing work related to campaign optimization fo¬ 
cuses on estimating the click through rate (CTR)/action 
rate (AR) or bid landscape, which helps to setup the bid 
price in the impression level. For a comprehensive survey 
of all those methods, please refer to However, those ap¬ 
proaches do not take into account of the smooth delivery 
constraint. 

Another research direction deals with the ad allocation 
problem. Chen et al. and Bhalgat et al. proposed 
to perform online ad allocation and solve the optimization 
problem for all campaigns in the marketplace level. Their 
focus is to find the best allocation plan to optimize revenue 




(a) Probabilistic throttling 


(b) Bid modification 


Figure 2: Factor dependency graph in Probabilistic 
Throttling v.s. Bid Modification. Factors in grey 
are involved in budget pacing control. Adding de¬ 
pendency between pacing rate and response rate is 
one of the key ideas of our work. 


on the publisher side while we focus on maximizing the de¬ 
livery and performance for each campaign. Bhalgat et al. 

proposed a different approach to perform ad allocation 
based on the inventory quality, which shares some common 
thoughts with our work. We both assume that the inventory 
is much larger than advertiser demand and therefore both of 
our solutions focus on how to select high quality inventory. 
However, their approach focuses on the allocation problem 
and the smooth delivery constraint does not appear in the 
formulation. Zhang et al. 13 proposed to combine bud¬ 
get allocation and bid optimization in a joint manner. We 
claim that the budget allocation can be complementary to 
our work in the campaign level. After the budget allocation 
is done, our approach of budget pacing can take place to 
further optimize the campaign goals in the impression level. 

Mehta et al. [^, Abrams et al. [^, and Borgs et al. 
suggested using bid modification while Agarwal et al. 
and Lee et al.[^ used probabilistic throttling to achieve 
pacing control, fn this paper, we use probabilistic throt¬ 
tling for several considerations: 1) Probabilistic throttling 
directly influences budget spending while bid modification 
changes the win-rate to control spending, which is not pre¬ 
ferred in the RTB environment. First, the bid win-rate curve 
is usually not smooth so modifying bid can cause significant 
changes in budget spending. Second, our observation on 
real serving data is that the bid landscape can be changed 
dramatically over time. Both issues make pacing control ex¬ 
tremely difficult through bid modification. 2) SSPs usually 
set reserve prices so pacing control may fail if the bid need 
to be modified to be below the reserve price [^ . 3) As shown 
in Figure throttling with a pacing rate decouples pacing 
control from bid calculation. This is an appealing feature 
because the pacing control can be developed independently 
and combined with any bid optimization implementation. 
In the rest of this paper, we assume the bid is already given 
by a preceding bid optimization module. 


3. PROBLEM FORMULATION 

We focus on two most prevailing campaign types: 1) 
branding campaigns that aim at spending out the budget 
to have an extensive reach of audiences, and 2) performance 
campaigns that have specific performance goals (e.g. eCPC 






















< $2). Other types of campaigns typically lie in-between 
these two extremes. A campaign of either type may have 
its unique budget spending plan. We first formulate the 
problem to be resolved and then outline our solution. 


3.1 Preliminaries 

Let Ad be an ad campaign, B be the budget of Ad, and G 
be the performance goal of Ad if there is. A spending plan is 
a sequence of budgets over a number K of time slots, speci¬ 
fying the desired amount of budget to be spent in each time 
slot. We denote by B = the spending plan 

of Ad, where 5*-^^ > 0 and X]t=i k ~ Reqi be 

the i-th ad request received by a DSP. As we discussed in 
section we use probabilistic throttling for budget pacing 
control in our work. Thus we denote by: 

Si ~ Bern(ri) the variable indicating whether Ad partic¬ 
ipates the auction for Reqi, where r; is the point pacing rate 
of Ad on Reqi. € [0,1] quantifies the probability that Ad 
participates the auction for Reqi. 

Wi the variable indicating whether Ad wins Reqi if it par¬ 
ticipates the auction, which depends on the bid bidi given 
by the bid optimization module. 

Ci the advertiser’s cost if Ad is served to ad request Reqi. 
We note that the cost consists of both the inventory cost 
and the creative serving cost, 

qi ~ Bern{pi) the variable indicating whether the user 
performs some desired response (e.g. click) if Ad is served 
to Reqi, where Pi = Pr(respond\Reqi, Ad) is the probability 
of such response. 

C = Si X Wi X Ci the total cost of ad campaign Ad. 

P — CI SiXWiX qi the performance (e.g. eCPC if the 

desired response is click) of ad campaign Ad. 

C = {C^^\ ... the spending pattern over the K 

time slots, where is the cost in the t-th time slot, > 
0 and = C- 

Given an ad campaign Ad , we dehne Q. to be the penalty 
(error) function that captures how the spending pattern C 
is deviated from the spending plan B. A smaller value will 
indicate a better alignment. As an example, we may define 
the penalty as follows: 


ff(C,B) 




^^(CW-BW)2 


( 1 ) 


3.2 The Problem of Smart Pacing for Online 
Ad Campaign Optimization 

Advertisers look for spending budget, executing spending 
plan, and optimizing campaign performance simultaneously. 
However, there may be multiple Pareto optimal solutions to 
such an abstract multi-objective optimization problem. In 
real scenarios, advertisers usually prioritize these objectives 
for different campaigns. For branding campaigns, advertis¬ 
ers typically put budget spending at the top priority, fol¬ 
lowed by aligning with spending plan while performance is 
not a serious concern. At serving time (i.e. the ad request 
time), since we use probabilistic throttling, the only thing 
that we can have full control is r^. Thus the problem of 
smart pacing for ad campaigns without specific per¬ 
formance goals is defined as determining the values of 


so that the following measurement is optimizecj^ 
min P 

Vi 

( 2 ) 

s.t. C = B, H(C,B) < e 

where e defines the tolerance level on deviating from the 
spending plan. On the contrary, for performance campaigns 
that have specific performance goals, achieving the perfor¬ 
mance goal is the top priority. Sticking to the spending plan 
is usually the least important consideration. We define the 
problem of smart pacing for ad campaigns with specific 
performance goals as determining the values of ri so that 
the following measurement is optimized: 

min H(C,B) 

r,- 

(3) 

s.t. P < G, B — G < s 

where e defines the tolerance level for not spending out 
all the budget. Given the dynamics of the marketplace, 
even both the single-objective optimization problems are 
extremely difficult to resolve. Existing methods that are 
widely used in the industry deal only with capturing either 
the performance goal or the budget spending goal. One 
example for achieving performance goal is always bidding 
the retargeting beacon triggered ad requests. Unfortunately, 
there is no guarantee to avoid overspending or underspend¬ 
ing. Another example for smooth pacing control is introduc¬ 
ing a global pacing rate so that all ad requests have the same 
probability to be bid by a campaign. However, none of these 
existing approaches can solve the smart pacing problem we 
formulate here. To tackle this problem, we examine the pre¬ 
vailing campaign setups and make some key observations 
that motivated our solution: 

• CPM campaigns: advertisers are charged a fixed amount 
of money for each impression. For branding advertis¬ 
ers, the campaign optimization is as dehned in Equa¬ 
tion As long as budget can be spent and spending 
pattern is aligned as the plan, high responding ad re¬ 
quests should have a higher point pacing rate than low 
responding ones so that the performance can be opti¬ 
mized. For performance advertisers (i.e. with eCPG, 
eGPA goal), the campaign optimization is as defined in 
Equation!^ Apparently, high responding ad requests 
should have higher point pacing rate to achieve the 
performance goal. 

• CPC/CPA campaigns: advertisers are charged based 
on the sheer number of clicks/actions. There is im¬ 
plicit performance goal to guarantee that DSP does 
not lose money when bidding on behalf of the advertis¬ 
ers. So it falls in the category of optimization dehned 
in Equation]^ Granting high responding ad requests 
high point pacing rates will be more effective from both 
the advertisers’ and DSPs’ perspectives: advertisers 
pay less on creative serving cost while DSP can save 
more ad opportunities to serve other campaigns. 

• Dynamic CPM campaigns: DSP charges a dynamic 
amount of money for each impression instead of a Hxed 
amount. These campaigns usually have specihc per¬ 
formance goals so the optimization problem falls in 
Equation]^ Similar with GPG/CPA campaigns, high 

2 

Based on our performance definition (i.e. eCPC or eCPA) a smaller 
value means a better performance. 





responding ad requests are more preferred in order to 
reduce creative serving cost and save ad opportunities. 

3.3 Solution Summary 

Motivated by these observations, we develop novel heuris¬ 
tics to solve the smart pacing problem. The heuristics try to 
find a feasible solution that satisfies all constraints as defined 
in Equation]^ orand then further optimize the objectives 
through feedback control. We first learn from offline serv¬ 
ing logs to build a response prediction model to estimate 
Pi = Pr(respond\Reqi, Ad), which helps distinguish high re¬ 
sponding ad requests from low responding ones. Second, 
we reduce the solution space by grouping similarly respond¬ 
ing ad requests together and the requests in the same group 
share the same group pacing rate. Groups with high respond¬ 
ing rates will enjoy high pacing rates (refer to the blue ar¬ 
row in Figure a)). Third, we develop a novel control-based 
method to learn from online feedback data and dynamically 
adjust the group pacing rates to approximate the optimal 
the solution. Without loss of generality, we assume cam¬ 
paign setup is CPM billing with or without an eCPC goal. 
Our approach can be applied to other billing methods and 
performance types as well as other grouping strategies such 
as grouping based on Pi/ci (the expected response per cost). 

4. RESPONSE PREDICTION 


for campaign optimization. In other words, the group pac¬ 
ing rates are dynamically adjusted throughout the campaign 
life time. For simplicity, pacing rate and group pacing rate 
are interchangeable in the rest of this paper, and we denote 
by ri the group pacing rate of the f-th group. 

5.1 A Layered Presentation 

For each ad campaign, we maintain a layered data struc¬ 
ture in which each layer corresponds to an ad request group. 
We keep the following information of each ad request group 
in the layered structure: average response rate (usually in 
the form of CTR, AR, etc) derived from the response pre¬ 
diction model; priority of the ad request group; pacing rate 
i.e. the probability to bid an ad request in the ad request 
group; and the campaign’s spending on the ad request group 
in the latest time slot. The principles here are: 1) layers cor¬ 
respond to high responding ad request groups should enjoy 
high priorities, and 2) the pacing rate of a high priority layer 
should not be smaller than that of a low priority layer. 

For each campaign, when the DSP receives an eligible 
ad request, it first decides which ad request group the ad 
request falls in and refers to the corresponding layer to ac¬ 
quire the pacing rate. The DSP then bids the ad request on 
behalf of the campaign with a probability that equals to the 
retrieved pacing rate at the price given by a preceding bid 
optimization module. 


Our solution depends on an accurate response prediction 
model to estimate pi. There are plenty of work in the litera¬ 
ture addressing this problem as we have reviewed in section 
Here we briefly describe how we perform this estimation. 
We use the methodology introduced in [2 11 and make some 
improvements on top of that. In this method, we first lever¬ 
age the hierarchy structures in the data to collect response 
feedback features at different granularities. For example, at 
ad side, starting from the root and continuing layer after 
layer are advertiser category, advertiser, campaign, and fi¬ 
nally ad. Historical response rates at different levels in the 
hierarchy structures are used as features to derive a machine 
learning model (e.g. LR, GBDT, etc) to give a raw estima¬ 
tion of Pi, say Pi. Then we utilize attributes such as user’s 
age, gender to build a shallow tree. Each leaf node of the 
tree identihes a disjoint set of ad requests which could hardly 
be further split into subsets with significantly different av¬ 
erage response rates. Finally, we calibrate Pi within the leaf 
node Reqi is classified using a piecewise linear regression to 
estimate the final pi. This scheme results in a fairly accurate 
response prediction. 


5. A CONTROL-BASED SOLUTION 

As we have discussed in section it is extremely difficult 
to reach the exact optimal solution to the problems defined 
in equations and in an online environment. We explore 
heuristics to reduce the solution space of the original prob¬ 
lems. More specifically, with the response prediction model 
described in section]^ similarly responding ad requests are 
grouped together and they share the same group pacing rate. 
Different groups will have different group pacing rates to re¬ 
flect our preferences on high responding ad request groups. 
The original problem of solving the point pacing rate of each 
Vi is reduced to solving a set of group pacing rates. We em¬ 
ploy a control-based method to tune the group pacing rates 
so that online feedback data can be leveraged immediately 


5.2 Online Pacing Rate Adjustment 

We employ a control-based method to adjust the pacing 
rate of each layer based on real-time feedbacks. Suppose 
we have L layers, the response rate estimation by response 
prediction model for each layer is p = (pi,... ,p_l), and 
hence if the desired response is click, the estimated eGPG 
of each layer is e = (ei,...,ei) where d = Let 

the pacing rate of each layer in the (t — l)-th time slot be 

and the spending of each layer 
be ... ,c^l~^'^), the control-based method 

will derive for the coming t-th time 

slot based on campaign objectives. 

5.2.1 Campaigns without Performance Goals 

We first describe the adjustment algorithm for ad cam¬ 
paigns without specific performance goals. Recall that for 
such campaign type, the primary goal is to spend out the 
budget and align with the budget spending plan. Thus at 
the end of each time slot, the algorithm needs to decide the 
amount of budget to be spent in the next time slot and ad¬ 
just the layered pacing rates to spend exact that amount. 

The budget to be spent in the next time slot is determined 
based on the current budget spending status. Given an ad 
campaign, suppose its total budget is B, budget spending 
plan is B = ...,B^^'^), and after running for m time 

slots, the remaining budget becomes Bm. We need to de¬ 
cide the desired spending in each of the remaining time slots, 
denoted as so that the total budget can be 

spent out and the penalty is minimized. 

arg min D 

(4) 

s.t. E 

t = m + l 





Algorithm 1 AdjustWithoutPerformanceGoal 

Input: R 

Output: 


1 : 

2 : 

3: 

4: 

5: 

6 : 

7: 

8 : 

9: 

10 : 

11 : 

12 : 

13: 

14: 

15: 

16: 

17: 

18: 

19: 


if R =— 0 then 

return 

else if i? > 0 then 

for each layer I in (L, ... ,l ) do 


(*) 

T; = mini 


R = R-ct"^ X 


'I 


end for 

^ = trial rate if 1 7 ^ 1 and > trial rate 

else 

for each layer Hn (Z ,..., L) do 


(t) 

ri = max 


(0.0, X 








(t 11 

R = R-ct"^ X 

if i? > 0 then 

= trial rate \f I ^ 1 and r[^'^ > trial rate 

break 
end if 
end for 
end if 


return r^*^ = (r-*^ 




1 : ■ • ■ 1 ' L 


in which if we adopt the definition of Q as in equation we 
have the following optimal solution: 


C 


(^) _^ 0 ) 


Br 


- B^^'> 

Z^t=m + l ^ 


K — m 


(5) 


where t = m + 1,..., A. We omit the details of how 
is derived because of page limit. In an online environment, 
suppose the actual spending in the latest time slot is 
we define R = to be the residual which can 

help us to make adjustment decisions. 

Algorithm gives the details of how the adjustment is 
done. Suppose index L represents the highest priority, index 
1 refers to the lowest priority, and let I be the last layer with 
non-zero pacing rate (the principles in section 5.1 guarantee 


the existence of Z ). If i? equals 0, no adjustment is needed. 
If i? > 0, which means delivery should speed up, pacing rates 
are adjusted in a top-down fashion. Starting from layer L, 
the pacing rate of each layer is increased one-by-one until 
layer Z . Line 5 calculates the desired pacing rate of the 
current layer in order to offset R. We give layer Z — 1 a 
trial rate to prepare for future speedups if layer Z 7 Z: 1 and 
its updated pacing rate r^P > trial rate. Figure 3 gives an 
example of how the speedup adjustment is done. If R < 0, 
which means delivery should slow-down, pacing rate of each 
layer is decreased in a bottom-up fashion until R is offset. 
Line 11 derives the desired pacing rate of the current layer 
to offset R. Suppose I is the last layer adjusted, Z ^ 1 and its 
new pacing rate rp^ > trial rate, we give layer Z — 1 the trial 
rate to prepare for future speedups. Figure]^ is an example 
how delivery is slowed down. 

We note that this greedy strategy tries to approach the 
optimal solution to Equation in an online environment. 
Within each time slot, it strives to invest on inventories with 


spending in the last time budget to spend in next 

slot: $2,800 time slot: $5,500 


response_rate:0.005 
priority: 3 
pacing_rate: 1.0 
spending: $1,000 


response_rate:0.005 
priority; 3 
pacing_rate; 1.0 
expected_spending: $1,000 

response rate: 0.003 

1- \ 

response rate: 0.003 

priority: 2 

1 -1.2 

priority; 2 

pacing rate: 0.5 

speed up 

pacing rate; 1.0 

spending: $1,500 


expected_spending: $3,000 

response rate: 0.0008 


response rate: 0.0008 

priority: 1 


priority; 1 

pacing rate: 0.001 (trial rate) 


pacing rate; 0.005 

spending: $300 


expected_spending: $1,500 


Figure 3: An example to speed up budget spending 


Spending in the last time budget to spend in next 

slot: $2,800 time slot: $900 


response rate:0.005 


response rate:0.005 

priority: 3 


priority: 3 

pacing rate: 1.0 


pacing rate: 0.9 

spending; $1,000 


expected_spending: $900 

response_rate: 0.003 
priority: 2 
pacing_rate: 0.5 
spending; $1,500 

slow down 

response_rate: 0.003 
priority; 2 

pacing_rate; 0.001 {trial_rate) 

response_rate: 0.0008 
priority: 1 

pacing_rate: 0.001 (trial_rate) 
spending; $300 


response_rate: 0.0008 
priority; 1 
pacing_rate: 0 


Figure 4: An example to slowdown budget spending 


the best performance under the total budget and spending 
plan constraints. 


5 . 2.2 Campaigns with Performance Goals 
For campaigns with specific performance goals (e.g. eCPC 
< $2), pacing rate adjustment is a bit complicated. It is dif¬ 
ficult to foresee the ad request traffic in all the future time 
slots and the response rate distribution can be time-varying. 
Therefore, given budget spending objectives, exploiting all 
the ad requests in current time slot that meet performance 
goal may not be an optimal solution to Equation 0 Algo¬ 
rithm [^describes how the adjustment is done for such kind 
of campaigns. We adopt the heuristic that a further adjust¬ 
ment based on performance goal is appended to Algorithm 
If the expected performance after Algorithm does not 
meet the performance goal, the pacing rates are reduced 
one-by-one from the low priority layers until the expected 
performance meets the goal. Line 7 derives the desired pac¬ 
ing rate of current layer to make the overall expected eCPC 
meet the goal. Function ExpPerf(c^*~^\ e, i) in 

Line 2 and 4 estimates the expected joint eCPC of layers 
i,..., L if pacing rates are adjusted from to , where 
ej is the eCPC of layer j. 


ExpPerf(c^* r*-* 


L 

E 


-i r 


(t-1) 


L 

E- 

j=z 




( 6 ) 


5.3 Number of Layers, Initial and Trial Rates 

It is important to set proper number of layers, initial and 
trial pacing rates. For a new ad campaign without any deliv¬ 
ery data, we identify the most similar existing ad campaigns 
in our DSP and estimate a proper global pacing rate rc at 
which we expect the new campaign can spend out its bud¬ 
get. Then the number of layers is set as L = • We note 

































Algorithm 2 AdjustWithPerformanceGoal 

Input: R, e, goal 

Output: 

1: r*‘^=AdjustWithoutPerformanceGoal(c^*“^\ R) 


2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 


if ExpPerf(c^* > goal then 

for each layer 1 in (1 ,..., L) do 

if ExpPerf(c*-*“^^, r^*\ e, Z + 1 ) > goal then 

rp’ = 0.0 

else 


Ad 








x(^ 


-- 1 ) 


JJ=T1 


y-a-^) 


if Z 7 I 1 then 
rl*2i = trial rate 

end if 
break 
end if 
end for 
end if 

return r^*^ = ..., 


that a moderate number of layers is more desirable than an 
excessive one for two reasons: 1) the delivery statistics of 
each layer is not significant if there are too many layers; 2) 
from the system perspective, excessive number of layers may 
use up bandwidth and/or memory. 

Once the number of layers is determined, we run the cam¬ 
paign at the global pacing rate ra in the first time slot. We 
call this step an initialization phase in which the delivery 
data can be collected. We group equal amount of impres¬ 
sions into the desired number of layers based on their pre¬ 
dicted response rate to identify the layer boundaries. In the 
next time slot, the pacing rate of each layer is reassigned 
based on planned budget in the next time slot and high re¬ 
sponding layers will have rates of 1.0 while low responding 
ones will have rates of 0.0. 

In the adjustment algorithms, the direct successive layer 
next to the layer with non-zero pacing rate is assigned a trial 
pacing rate. The purpose is to collect delivery data in this 
layer and prepare for future speedups. This trial rate is sup¬ 
posed to be quite low. We derive such a rate by reserving 
a certain portion A (e.g. A = 1%) of budget to be spent in 
the next time slot. Let the trial layer be the Z-th layer and 
the budget for next time slot is , recall that we have his¬ 
torical spending and pacing rate of this layer from at least 
one time slot (the initialization phase), the trial pacing rate 
is derived as trial rate = r^*'^ x ^ , where c[*^ and r^*^ 

are the historical spending and pacing rate of the Z-th layer. 

As a quick summary, we employ a layered presentation of 
all the ad requests based on their predicted response rate and 
execute budget pacing control in the layer level to achieve 
delivery and performance goals. Both the spending in the 
current time slot and the remaining budget are considered to 
calculate the layered pacing rates in the next time slot. We 
also tried the alternative to control a threshold so that only 
ad requests with predicted response rate above the threshold 
were bid. The outcome of such alternative, however, was not 
satisfactory. The main reason is that ad requests are usually 
not smoothly distributed over response rate, and therefore it 
is difficult to realize smooth control with a single threshold. 


6. EXPERIMENTAL EVALUATIONS 

We conduct extensive experiments on both real ad cam¬ 
paigns and offline simulations to evaluate the effectiveness 
of our approach. Without further specification, the follow¬ 
ing setting is used throughout the experiments: the baseline 
approach is our approach with only one layer (i.e. using a 
global pacing rate), the timespan and time slot interval are 
24 hours and 15 minutes respectively, the spending plan is 
even pacing, and the initial pacing rate and trial budget frac¬ 
tion in our approach are re = 0.01 and A = 1% respectively. 
We look at all the three aspects discussed throughout this 
paper: 1) performance, 2) budget spending, and 3) spend¬ 
ing pattern. Since the value of the penalty 12 as defined in 
Equationis hard to interpret, we transform it into the fol¬ 
lowing metric: AvgErr = x 12, which quantifies the 

relative deviation of the actual spending from the average 
planned spending per time slot, where K is the number of 
time slots and B is the total budget. 

6.1 Results from Real Campaigns 

We pick up from our DSP system 3 campaigns for online 
A/B test and another 4 campaigns for online over-time test. 
All these 7 campaigns are GPM campaigns and two of them 
have specific eGPG goals. 

In the A/B test, we deploy 8 layers of group pacing rates in 
our approach. Gampaign 1, 3 are without performance goals 
while campaign 2 has an eCPC goal of $2.5. Figurej^shows 
the test result for these campaigns. Our approach is sur¬ 
prisingly effective in boosting the performance. Gompared 
to the baseline, the eGPG reductions are —72%, —67%, and 
—79% for the three campaigns respectively. We note that 
for Gampaign 2, the baseline solution fails to meet the eGPG 
goal. On the other hand, the total spendings and spending 
patterns of our approach are as good as the baseline. The 
AvgErr comparisons of the baseline and our approach are 
6.4% : 6.8%, 9.1% : 9.2%, and 10.2% : 9.8% for the three 
campaigns respectively. Gampaign 3 has experienced spend¬ 
ing fluctuations and we found there was another competing 
campaign went offline at time slot 25 so the win-rate of Gam¬ 
paign 3 surged. Our algorithm can recover very soon from 
the environment changes and continue to deliver smoothly. 

The result of over-time test is summarized in Table [T] In 
this test, we run 4 campaigns for 2 weeks with baseline so¬ 
lution in the first week and our approach with 3 layers in 
the second week. Among the 4 campaigns, only Gampaign 
4 has a specific eGPG goal of $7. Our approach success¬ 
fully reduces the eGPG of all the 4 campaigns. The most 
significant reduction almost comes to —50%. Our approach 
does not show significant eGPG reduction on Gampaign 7 
because its average pacing rate AvgPR is already around 
0.6 and there is not much room to improve its performance. 
We note that the baseline fails to achieve the eGPG goal of 
Campaign 4. In real applications, compromising the per¬ 
formance goal may permanently lose the advertiser, which 
should always be avoided. From the delivery perspective, 
both the baseline and our approach manage to spend out 
the total budget smoothly with less than 13.9% deviation 
from the spending plan. 

6.2 Offline Simulations 

We also conduct extensive offline simulations to further 
assess the effectiveness of our approach. We randomly se¬ 
lect from our demand pool an ad campaign to generate the 
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Figure 5: Online A/B test result of 3 real campaigns running on our DSP system (# layers: 8). 


Campaign Setup 

Baseline (1st week) 

Smart Pacing (2nd week) 


ID 

Budget 

CPM 

eCPC goal 

Spending 

Omega 

AvgErr 

eCPC 

AvgPR 

Spending 

Omega 

AvgErr 

eCPC 

AvgPR 

eCPC reduction 

4 

S35K 

$3.10 

$7.00 

$35.2K 

20.43 

5.6% 

$10.20 

0.0024 

$35.3K 

20.72 

5.7% 

$5.93 

0.0026 

-41.9% 

5 

S20K 

$3.00 

No goal 

$20.4K 

12.21 

5.9% 

$14.78 

0.019 

$20.2K 

12.32 

5.9% 

$7.51 

0.017 

-49.19% 

6 

$2K 

$3.35 

No goal 

$1.97K 

2.89 

13.9% 

$10.56 

0.017 

$2.OIK 

2.79 

13.4% 

$8.61 

0.015 

-18.47% 

7 

S7.5K 

$3.00 

No goal 

$7.37K 

9.08 

11.6% 

$9.90 

0.59 

$7.58K 

9.19 

11.8% 

$9.02 

0.65 

-8.89% 


Table 1: Online over-time test result of 4 real campaigns (# layers: 3). 




Predicted CTR 


(b) Ad request and win-rate distribution over predicted CTR 


Figure 6: Offline simulation setup: simulation data is collected from real serving logs of the DSP. 


simulation data. First, the eligible ad request distribution 
over 24 hours is collected (Figurea)). Second, the traffic 
as well as the win-rate distribution over predicted CTR is 
collected (Figure [^b)). We further assume the total num¬ 
ber of ad requests is 10,000,000 and billing CPM is $5. The 
campaign may or may not have a specific eCPC goal. 

We are interested in how our approach performs when 
compared with the state-of-the-art budget pacing method 
proposed by Agarwal et al [^. In their method, a global pac¬ 
ing rate is dynamically adjusted by ±10% every one minute 
to make the cumulative spending align with the allocated 
budget. The budget allocation is based on forecasted cu¬ 
mulative traffic. In our experiment, the traffic pattern in 
Figure |^a) is further granulized into one minute per time 
slot, and we use the average traffic of previous seven days as 
the forecasted traffic to determine budget allocation. Fig¬ 
ure l^a) shows the cumulative traffic and allocated budget 
curves. Please note that the budget spending plan here is 
not even pacing. Since their method does not consider per¬ 
formance, no eCPC goal is set. We use 8 layers of pacing 
rates in our approach. From the result shown in Figurej^b), 


both approaches successfully align the cumulative spending 
curves to the allocation curve. A subtle difference is that 
their spending curve has fluctuations around the allocation 
curve. If we look at the AvgErr which captures the relative 
deviation in the per time slot level, their approach gets a 
surprisingly high error of 96% while our approach generates 
a much smaller error of 18%, which means our approach 
can produce smoother pacing result even with very granular 
time slots. From the control theory perspective, their ap¬ 
proach depends on a conventional feedback controller while 
our approach essentially uses an adaptive controller whose 
parameters are adaptive to the spendings in the current time 
slot and planned budget in the next time slot. In the RTB 
environment, where the plant is complex and time-varying, 
an adaptive controller usually performs better than a con¬ 
ventional feedback controller [^. Moreover, since our ap¬ 
proach models the quality of the ad requests, it achieves a 
—70% lower eCPC compared to their approach. 

We continue to further study the behaviors of our method. 
When no eCPC goal is specified, we first fix the campaign 
budget at $2,000 and vary the number of layers. As we can 
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(b) Actual spendings and eCPCs comparison (eCPC is calculated every 
15 minutes to collect sufficient number of clicks) 


Figure 7: Comparison with state-of-the-art approach with arbitrary spending plan 


observe from Figure |^a), the top objective to spend out 
budget as much and as smooth as possible are all achieved 
except when the number of layers is extremely large (i.e. 
^ layers = 256). The reason is that an excessive number 
of layers delays the layer-by-layer adjustment to adapt to 
traffic changes. The difference of eCPCs as shown in Fig¬ 
ure [^c) speaks of the advantage of our approach. When we 
increase the number of layers, we have more discernability 
and flexibility to cherry-pick high performing ad requests - 
especially in those time slots when the ad request volume is 
high. Then we fix the number of layers at 8 and vary the 
total budget. As Figure [^b) shows, when the budget keeps 
going up, there are relatively less available supply so that 
the budget spending pattern will be more influenced by ad 
request volume fluctuations. Another interesting observa¬ 
tion is that the campaign performance is better when the 
budget is less (Figure [^d)). This is because a small budget 
means our approach can cherry-pick the best performing ad 
requests without compromising the budget spending objec¬ 
tives. 

When a specific eCPC goal is specified, keeping eCPC 
below this goal is the most important task. Our simula¬ 
tion results in this scenario are shown in Figure when the 
eCPC goal is set to $0.8. Please note that whenever the pac¬ 
ing rate of every layer is adjusted to zero, which means the 
performance goal cannot be achieved based on our estima¬ 
tion, we reset the pacing rates of all the layers so that only 
the top priority layer has a trial rate. Again, we first fix 
budget at $2,000 and vary the number of layers. Different 
from when no eCPC goal is set, we find that the spendings 
are extremely low when there are only 1 or 2 layers (Figure 
l^a)). Because in such cases, the eCPC goal can never be 
achieved and the pacing rates are kept being reset. It can be 
reconfirmed by looking at Figure j^c), in which only when 
number of layers is 4, 8, or 256 the average eCPC is below 
$0.8. When we fix the number of layers at 8 and vary the 
budget, we also observe different results from when there is 
no eCPC goal, e.g. Figure [Mb) shows quite different spend¬ 
ing pattern from Figure jsfb). The reason is that to achieve 
eCPC goal, sometimes we need to sacrifice budget spending 
objectives - especially when the traffic is low. This is even 
more apparent when the budget increases (Figure[^d)). We 
note that it is desired and is exactly one of our contributions. 


7. SYSTEM IMPLEMENTATION 

In a large scale online ad serving system, there are many 
infrastructure and implementation issues need to be addressed 
such as data consistency, service availability, and fault tol¬ 
erance, that are, however, out of the scope of this paper. 
In this section, we mainly focus on the following challenges. 
rapid feedback of layered statistics: as described in section 

the pacing rate adjustment is mainly based on the lay¬ 
ered delivery statistics. Therefore, how to collect the online 
feedback data efficiently and reliably becomes a major imple¬ 
mentation challenge. Overspending prevention: overspend¬ 
ing should always be avoided since it undermines either the 
advertiser or the DSP’s interest. Thus, a quick stop mecha¬ 
nism is necessary to prevent overspending. 

We address these challenges by implementing a real time 
feedback pipeline as well as an in-memory data source. As 
illustrated in Figure [Toj impression serving boxes receive im- 
pression/click events and produce delivery messages into the 
message queue. The in-memory data source consumes mes¬ 
sages from the message queue and performs aggregations on 
top of these messages. Finally the controller refers to the 
in-memory data source to send quick stop notifications or 
adjusted pacing rates to the bidders. 

7.1 Real Time Feedback 

Traditional ad serving systems log events such as ad re¬ 
quests, bids, impressions, clicks to an offline data warehouse, 
and perform offline processing on top of it. However, the 
delay is too high to fit in our scenario. Therefore, building 
a separate pipeline to provide real time feedback to online 
serving system becomes an essential requirement. 

We implement both message queue and remote procedure 
call (RPC) in our system to transfer messages. Message 
queue is used to send delivery messages from impression 
serving boxes to in-memory data source while RPC is used 
to send pacing rates and quick stop notifications to ad re¬ 
quest bidders. Message queue works in asynchronous mode, 
which means producer and consumer are decoupled and pro¬ 
ducer has no idea whether consumer has consumed the mes¬ 
sage or not. The asynchronous nature makes it easy to 
achieve high throughput and low latency. RPC works in 
synchronous mode, which means caller will get success or 
failure responses from callee. Therefore, the motivation to 
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(a) Spendings over time with different ^ of layers (Budget:$2,000. Budget 
spending plan is compromised when ^ of layers is excessive.) 
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(b) Spendings over time with different budget of layers: 8 ) 
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(c) eCPC over time with different ^ of layers (Budget:$2,000) 
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Figure 8: Simulation result on campaign without performance goal 
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(a) Spendings over time with different ^ of layers (Budget:$2,000) 
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(b) Spendings over time with different budget ( 9 ^ of layers: 8 . Budget 
spending plan is sacrificed to meet the eCPC goal when traffic volume is 
low.) 
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(c) eCPC over time with different of layers (Budget:$2,000. For base¬ 
line and ^ layers= 2 , we plot their overall eCPC since only limited number 
of time slots have clicks) 
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(d) eCPC over time with different budget of layers: 8 ) 


Figure 9: Simulation result on campaign with performance goal eCPC < $0.8 
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Figure 10: The implementation architecture. 


implement both mechanisms is clear: to achieve extremely 
high throughput (typically billions of impressions per day), 
impression serving boxes do not need to track whether the 
messages have been consumed as long as there is no mes¬ 
sage lost. On the contrary, to make sure the pacing rates 
and quick stop messages are sent and applied to each bidder, 
the controller needs to get responses from all the bidders. 

Although we have implemented a message queue to trans¬ 
fer delivery messages asynchronously, the huge amount of 
delivery information requires some further designs. Batch 
processing and micro-aggregation are the two endeavors to 
further drive efficiency. At impression serving boxes side, 
a batch accumulates hundreds of delivery messages into a 
single request over the wire to reduce network overhead and 
amortize transmission delay. This is especially useful for 
high latency links such as those between data centers. At 
message queue side, a batch groups multiple small I/O op¬ 
erations into a single one to improve efficiency. Another 
practice we have is aggregating the delivery messages under 
certain conditions without information loss, e.g. aggregating 
impressions of the same campaign with the same predicted 
response rate within a certain time period. 

7.2 In-memory Data Source 

The in-memory data source stores the layered delivery 
information of each campaign. The main challenges of im¬ 
plementing such an in-memory storage are: 1) how to avoid 
data loss in case of failure, and 2) how to control memory 
usage. In order to address the potential data loss problem 
in system failures, we employ a snapshot plus commit log 
approach. We observe that the message queue itself can 
be a commit log as long as it honors the order of message 
sequence. With the help of message queue, we can easily 
recover the state using the message sequence number and 
the snapshot data. We address the memory usage issue by 
aggregation. As described in section the pacing rate ad¬ 
justment is based on the delivery information of the most 
recent time slot and therefore only raw messages within con¬ 
figurable recent time need to be stored. Historical data is 
stored in aggregated form to minimize the memory usage. 

8. CONCLUSIONS 

We have presented a general and principled approach as 
well as its implementation in a real DSP system to per¬ 
form smooth pacing control and maximize the campaign 
performance simultaneously. Experimental results showed 
that, compared to state-of-the-art budget pacing method, 
our proposed approach can significantly boost the campaign 
performance and achieve smooth pacing goals. 

Our future work will mainly focus on trying out different 
pacing schemes such as pacing based on performance with 


the help of supply and performance forecasting techniques, 
combining the proposed approach with other control meth¬ 
ods to make it more intelligent and robust, and studying the 
competitions and interactions among multiple campaigns in 
pacing control. 
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